Automatic suggestion of phrasal-concept queries for literature search

نویسندگان

  • Youngho Kim
  • Jangwon Seo
  • W. Bruce Croft
  • David A. Smith
چکیده

Both general and domain-specific search engines have adopted query suggestion techniques to help users formulate effective queries. In the specific domain of literature search (e.g., finding academic papers), the initial queries are usually based on a draft paper or abstract, rather than short lists of keywords. In this paper, we investigate phrasal-concept query suggestions for literature search. These suggestions explicitly specify important phrasal concepts related to an initial detailed query. The merits of phrasal-concept query suggestions for this domain are their readability and retrieval effectiveness: (1) phrasal concepts are natural for academic authors because of their frequent use of terminology and subject-specific phrases and (2) academic papers describe their key ideas via these subject-specific phrases, and thus phrasal concepts can be used effectively to find those papers. We propose a novel phrasal-concept query suggestion technique that generates queries by identifying key phrasal-concepts from pseudo-labeled documents and combines them with related phrases. Our proposed technique is evaluated in terms of both user preference and retrieval effectiveness. We conduct user experiments to verify a preference for our approach, in comparison to baseline query suggestion methods, and demonstrate the effectiveness of the technique with retrieval experiments.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Statistical Features to Find Phrasal Terms in Text Collections

In this work we investigate alternatives to automatically detect phrasal terms, defined here as phrasal verbs, phrasal nouns, phrasal adjectives or phrasal adverbs found in a text. The automatic identification of phrasal terms may have several applications in text processing systems. We approach this problem and present a novel approach for detecting phrasal terms in a collection of documents. ...

متن کامل

A personalised query suggestion agent based on query-concept bipartite graphs and Concept Relation Trees

Queries submitted to a Web search engine are usually short and ambiguous. Currently, most search engines respond to a user’s query by using the bag-of-words model, which matches keywords between the query and Web documents but ignores contexts and users’ preferences. Thus, many irrelevant results are returned by the conventional search engines. Query suggestion is a way for extending queries to...

متن کامل

Abstractive Summarization of Spoken and Written Conversations Based on Phrasal Queries

We propose a novel abstractive querybased summarization system for conversations, where queries are defined as phrases reflecting a user information needs. We rank and extract the utterances in a conversation based on the overall content and the phrasal query information. We cluster the selected sentences based on their lexical similarity and aggregate the sentences in each cluster by means of ...

متن کامل

Query Suggestion by Concept Instantiation

A class of search queries which contain abstract concepts are studied in this paper. These queries cannot be correctly interpreted by traditional keyword-based search engines. This paper presents a simple framework that detects and instantiates the abstract concepts by their concrete entities or meanings to produce alternate queries that yield better search results. 3

متن کامل

Learning to Measure Quality of Queries for Automatic Query Suggestion

Users tend to use their own terms to search items in structured search systems such as restaurant searches (e.g. Yelp), but due to users’ lack of understanding on internal vocabulary and structures, they often fail to adequately search, which leads to unsatisfying search results. In this case, search systems should assist users to use different terms for better search results. To address this i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Process. Manage.

دوره 50  شماره 

صفحات  -

تاریخ انتشار 2014